

1 


TGACAGAACA 


TGTGCAACAA 


CAGCCCCGAG 


TATGGTTTGG 


TTTTGAGGAT 


51 


CTCGGGTGCT 


ATGGCCACCA 


TGGT :TGGAG 


TGGGGTGG3T 


GCTGGGTGTG 


101 


GCATGGCGAA 


GAATGGCACC 


GGCATCATGG 


GGATGTGTGT 


CATGTGGCCA 


151 


GAGCTGATCC 


ACATGAAG TG 


GATCATCCCA 


GTGGT :ATGG 


CTGGTATCAT 


201 


CACCATCTAT 


GGCCTAGTGG 


GGGCTGTCC^ 


GGGTG::AAG 


TGCGTGAATG 


2 51 


ATGACAACAG 


TGTGTATAGG 


AGTTTCCTGC 




TGGCCTGAGT 


3 01 


GGCCTGGCAG 


CCGGCTTTGC 


gatggtgat: 


GTGGGGGACA 


CTGGCAAGTG 


3 51 


TGGGAGTGGC 


CAGCAGCCCC 


GAGTATTTGT 


AGGGATGATA 


GTGATCCTCA 


4 01 


TCTTTGCCAA 


GGTGCTCATT 


GTGTGGAGAA 


AGGAGGGGGT 


GTCAAAACCC 


451 


ACCAGTCACA 


GAATACGATG 


TAAAGAO"^ 


^ : :t "GTGAT 


T :G 3GAACAA 


5 01 


ACAGCCTGAC 


ACGGATGTGC 


TGGGGAGGTG 


GGGG TGAGTA 


GTTGATCTTC 


551 


TAAGTGTAGA 


GTGTGGTGGT 


GTTGATCGTG 


tgttggg:ag 


gccttgg:c: 


601 


GTCGGGCGCC 


ATGGTGTGGA 


CATGTGAAGC 


ta: 





FEATURES : 



5'UTR: 1-9 

Start Godon: 10 

Stop Codon: 62 5 

3'UTR: 628 
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HOMOLOGOUS PROTEINS: 

Top BLAST Hi ts : 

gi ! 4S0H313 j ref |NP_001685 . 1 | ATPase, H+ transporting, lysosomal 
gi j 137477 | sp| P23956 |VATL_BOVIN VACUOLAR ATP SYNTHASE lb FX 1 PROT 
gi i^'^'UP Iprf | | 1713409A H ATPase 16F [Bos taurus] 

gi j 3024312 | Sp | 018882 | VATL_SHEEP VACUOLAR ATP SYNTHASE 16 FX' PRO 
g l ; 4 1 P 1 7 9 1 sp | QO 3 1 0 5 | VATLJTORMA VACUOLAR ATP SYNTHASE 1 6 YD PROT 
gi | ::■ 144 | ref |NP_033859 . 1 1 ATPase-like vacuolar proton channel 
gi.|679S4 pir||PXBOV6 H+ - transport ing ATPase (EC 3.6.1.35), vacu 
gijLj."4 78 sp | P2 3 3 30 | VATL_DROME VACUOLAR ATP SYNTHASE 16 rX'A PRO 
g l I s :* 3 4 4 1") 3 | sp | 01 6 1 1 0 | VATL_AEDAE VACUOLAR ATP SYNTHASE 16 YD PRO 
gi | 171 H09 5 j sp | P5 527 7 j VATL_HELVI VACUOLAR ATP SYNTHASE 16 YD PRO 
41 1 4 01 3 3 4 ' sp ! P 3 14 03 \ VATL^MANSE VACUOLAR ATP SYNTHASE 16 YD PRjT 
31 ; 10442628 I gb I AAG17394 . 1 j AF277150_1 (AF277150; V-ATPase 36 kD 
gi | ^294725 | gb |AAF50062 . 1 | (AE003544) CG7547 gene product jDroso 
gi j 24 SO-< 14 2 j sp j Q2 625 0 j VATL_NEPNO VACUOLAR ATP SYNTHASE 16 FD PRO 
31 ! 251V"4 I gb| AAB22509 . 1 I vacuolar HU) -ATPase proteolipid subun 
gi I 24 9 ■ 14 3 I sp j Q0 06 0 7 | VATL_CANTR VACUOLAR ATP SYNTHASE 16 FX) PRO 

E^LA; : rr_ro dbEST : 

gij"*336427 /dataset=dbest / taxon= 96 0 . 

ji j 635 9305 /dataset=dbest / taxon=9606 ... 

gij ."'13 4 2 24 /da taset =dbe st / taxon= 96 06 . . . 
gi ! 10:o.9:.14 /dataset = dbest /taxon=96. . . 

gi I 334 ''2 17 / jar aset = dbe st / taxon= 96 0 . . . 

gi!. J 152104 / dat aset = dbest / taxon= 96 06 . . . 

gi|93941ob / iatasot=dbest / taxon= 96 0 . . . 

EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

Exp ress 1 on in formation from BLAST dbEST hits: 

gi I 7336427 Human uterus 

gi|635 9r'05 Human fetal liver 

gi 19134224 Human brain 

gi j 102 i .9 1 14 Human lung 

gi I 93472 17 Human placenta 

gi| 9152 104 Human skin 

gi I 9894 156 Human ovary 

Exp ress on information fr om P CR -based tissue screening pa nels 

Human Bene marrow 

Human Brain 

Human Colon 

Human Fetal Brain 

Human r et al heart 

Human Fetal Kidney 

Human r ocal 1 iver 

Human Heart 

Human Y\ iney 

Human Liver 

Human Lung 

Human Pancreas 

Human P 1 acenta 

Human Prostate 

Human Skeletal Muscle 

Human Small Intestine 

Human Spleen 

Human Testis 



Score E 

181 5e-45 

180 8e-45 

180 le-44 

178 3e-44 

175 5e-43 

174 8e-43 

165 3e-40 

157 le-37 

156 2e-37 

153 le-3-5 

153 le-3-5 

151 Se-36 

151 7e-36 

150 2e-35 

150 2e-35 

144 9e-34 



Score E 

165 le-33 

165 le-33 

165 le-33 

165 le-33 

165 le-38 

161 2e-37 

155 le-35 
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1 MSNNS PEYAL VFTI SGAMAT MVSSGLGAAC GMAKNGTGIM AMSVMWPELI 

51 HMKSIIP\ r VM AGIITIVGLV AAV P PANS LN DDNSLYSSFL QLGAGLSGLA 

101 AGFAI VI VGD TGKCGTAQQP RLFVGMILIL I FAKVLILST KQPLSKPTSH 

151 PIRCKDHPSS FRNKOPDTHV LGSWPSWDL LSVQCPRVHR LLARPCPLPP 
2 01 KAVDI 



FEATURES : 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASN_GLYCOSYLATIONN-glycosylat ion site 
35-38 NGTG 



[2] PDOC00005 PS00005 PKC_PHOS PHO_SI TEProt e in kinase C phosphorylation site 
Number of matches: 4 

I 111-113 TGK 

.": 13 9-141 ST?: 

; 149-151 SHR 

4 160-162 3FR 



[3] PDOG0 0006 PS00006 CK2_PHOSPHO_S I TECase in kinase II phosphorylation site 
Number of matches: 2 

78-81 3LND 
2 176-179 SWD 



[4] PDOC00008 PS00008 MYRI STYLN -myri s t oyla t ion site 
Number of matches: 8 



1 16-21 GAMATM 

2 25-30 GLGAAC 

3 27-32 GAACGM 

4 31-36 GMAKNG 

5 68-73 GLVAAV 

6 93-98 GAGL3G 

7 98-103 GLAAGF 

8 172-177 GSWPSV 
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Membrane spanning structure and domains: 



Hel ix 


Begin 


End 


Score 


Certainty 


1 


14 


34 


1 . 389 


Certain 


2 


37 


57 


0.733 


Putative 


3 


60 


80 


2 . 0 3 C) 


Certain 


4 


95 


115 


1 . 775 


Certain 


5 


127 


147 


1 .699 


Certain 



BLAST Alignment to Top Hit: 

>gi ! 45023 13 | ref | NP_001685 . 1 I ATPase, H+ transporting, lysosomal 
(vacuolar proton pump) 16kD 

>gi | 137479 | sp ! P27449 | VATL_HUMAN VACUOLAR ATP SYNTHASE 16 

KD PROTEOLIPID SUBUNIT >gi | 1073 94 | pir | | A3 93 67 

H+ -transport ing ATPase (EC 3.6.1.35) chain PKD1 - human 

>gi | 189676 | gb | AAA60039 . 1 | (M62762) vacuolar H+ ATPase 

proton channel subunit [Homo sapiens] 

Length = 155 

Score - 181 bits (455), Expect = 5e-45 

Identities = 110/153 (71%), Positives = 114/153 (73%), Gaps = 14/153 (9%) 



Query : 


2 


SNNSPEYALVFTISGAI^TMV'SSGIj^AACGMAOTGTGIMAMSVMV/PELIHMKSIIPvl^ 


61 






S + PEYA F + GA A MV S LGAA G AK+GTGI AMSVM PE I MKSIIPWMA 




Sbjct : 


4 


SKSGPEYASFFAVMGASAAMVFSALGAAYGTAKSGTGI AAMSVMRPEQI - MUSI I PWMA 


62 


Qiue ry : 


62 


GIITIYGLVAAVPPANSLNDDNSLYSSFLQLGA GLSGLAAGFAI VI VGDTGKCGTA 


117 






GII IYGLV AV ANSLNDD SLY SFLQLGA GLSGLAAGFAI IVGD G GTA 




Sbjct : 


63 


GI IAI YGLWAVLIANSLNDDI SLYKSFLQLGAGLSVGLSGLAAGFAIGIVGDAGVRGTA 


122 


Query : 


118 


QQPRLFVGMILILI FAKV LILSTK 141 








QQPRLFVGMILILIFA+V LILSTK 




Sbjct : 


123 


QQPRLFVGMI L I L I FAEVLGLYGLI VALILSTK 15 5 





Hmmer search results (Pfam) : 

Model Description S cor e E-value N 

PF00137 ATP synthase subunit C 14.8 0.028 2 

Parsed for domains: 

Mo de 1_ Domain seq-f seq-t _ hmm -_f h mm- 1 score E-value 

PF00"l37 1/2 12 73 V/ 1 65 [. 7.6 2.4 

PF00137 2/2 89 133 .. 1 53 [ . 14.6 0.031 
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1 GCTGTGGGGC CAGGAAAAGG AGAGAAGGTG AAACCCCCGT CAGTCCCTCA 

SI CAATCAGCAC GTGGAAATCT AGAAATGGAG GAGAGGCCTG GACTCATGGT 

101 GGAATCCAGA ATGAAAGAGG TGGACGACTG AATGAGCAGA AGGAGGCAAG 

151 GACGAGAGGC TTGGGGGTGA CATTTCTTGG AAGTGGCCTG GAGCTGGCAG 

201 ATGAGAACTC TGGCTACCCG TCCTCATTCC ACTAACAGTA GCTCCTCTAA 

2 51 CGACATGCCC CTTCCCTCTG TACCCCGCTC CGCATGCGGC AAGTAGTTCC 

301 gggacgcgac ccttccccct gtacgc:gct gcg :atgggg CCAGTAGTTC 

3 51 CCGGACGCGC CCCTTCCCTC TGTACCCCGC TCCGCATGCG GCAAGTAGTT 

4 01 CCCGGACGCG CCCCTTCCCT CTGTACC 3GG :TOCGGATGG GGAAAGTAGT 

4 51 TCCTACGGTG TTGGTTTTGC ATGTAGATGA AACCCTTTGA GGGGTAAAGG 
501 TTTTTTTTTT AAGTACTTTA GCAAATGCAA ACTGTTATTA TCAATATTAG 

5 51 CCAGCATCTT TTTTTTTTTT TTTTTTTTTT TTTTTGAGAT GGAGTTTCGC 
0 01 TCTTGTCACC CAGGGTGGAG TGCAATGGCA AAATCTAGGC TCACTGGAAC 
0 51 CTCCGCCTCC CAAGTTCAAG CGATT2TC2T G2CTTAGCCT CCCAGGTAGC 
701 TGGGATTACA GGCGTGTGCA ACCACACCCA GCTAATTTTT GTATTTTTAG 
751 TAGAGACAGG GTTTCACCAT GTTGGC 2AGG CTGGTCTCGA ACTCCTGACC 
3 01 TCATGTGATC CATCCGCCTC AGCCTCC 2AA AGTGGTGGGA TTACGTAGCC 
351 AGTGTGTTTG TTAAGTGCCT GTCAAATAAT GCTCCTGGTT TATAAGTGGG 
■?01 CCTGGCTCTA CCTTCTGGGT GCTCAGACAC CAACACAGAG AGAAGAGAAT 
0 51 TAACATCCTG AGAAGTTACA TATGCTAAAA TATAAAGAGT AAGATTGTGA 

10 01 GGAAAGTGGA GGGGAAGCAG GTAGGTTAG 3 AAAAGGTATC CTGAGTTTT : 

l;-51 TGCTGACCGA TGAGTGATAA TTCTTGAATT TCGGTGCTGG AAAGGT 2 CAT 

13.01 TAAGCATTCC AGGAGATTCT A3GGA3CTTC CAGAATGGTA GAAGAAGTGG 

1151 AACCATAAAG CCTGGGGAAG GGATGGAAGT CCTTGGGAAA GAAGCACTAA 

12 01 A2AGCCCAGT GGAGACAAGG AAGGACT 3GT CTCTCCTGTG CTTCGAGCCC 

12 51 A3CAATGATT ATTCACTGAG ATATGCCCCG GCAGGTCCTG CTGCTA 3AGC 

1301 GAGTGGTGTT CCCAGACGCA GGGAAGGTGC CATCCTACCC CTGACAGGAA 

1>51 AGAGGGGAGG AGGTGGGGCT GCCCCGGGTG C 2TGGTGTTG GGAGGGGCCG 

14 01 GG 3GGAATCC CGGGTGTTGG GAGGACAAGG GAGAGTGAGG TAGCTGTGAG 

14 51 GCTAGGGGAG AAGACCTCTC TAGTTTGGGA GAGACCCCTC CTTTCCTA3C 
i"'.ti t2:ttgtact TGCAAAAAGG CAGGCTTCCT GCTGTTACTA ACCATACCA 3 

15 51 GAGTGAGTAT ACAGCAGCCA GAAAGATTGT GAGAAACCTG TGATAGAG.AA 
10 01 AAAGAGATGC GGAAGGGGGA GAAGAG.AATT TGATAGGAGA CT A 3GG AAA 3 
10 51 AGAAT 3 3GAA CTTGTGGTCT AAAGAGGGAA GGAAGTGTGG CCAACATGGT 

1 V .1 GAAATCCCAT CTCTATTACA AATACAAAAA TTAGCTGGGC ATGGTAGT 3C 
1?51 ATGTCTGTAA TCCCAGCTAC TCAGGAGGGT AAGGGATGAG AATGAGTTGA 
H"l GACTGGGAGG CGGAGGTTGC AGTGAGCCGA GACTGCACCA CTGCAGTC TA 
L3 51 G3CTGGGCAA CAGAGGAAGA CCTCGTCTCA AAAAAAAAAA AAAATTAAAT 

1. V.a TTAAATTAAA AAAAAATAAA CAGGGAAC 2 A ACAAGAGCTG GCAGAACAGA 
1^51 ATAAAGTCTC AAGGGAAATA ACTCCCTTGC GTTGGAAGAA CAAGGCTG'.:C 
2 : ; "J 1 A3C3T3TTGG AGCGTGTGTT TATGGGGTAG CAGTTCAAAG GACAGTGAGC 
2:)51 GTGAGGTGGG CTGGGAGGCC GTGCGGGGTG CCAGATGAAA ACAATAG3CC 
2:01 T3TTTTCCTG AGCTCTTGCT GTAATCCA3A AGGCCCACAC AGAGAGGAAG 

2 121 AGGGGGGG AA AGGGAGTGGC TATACCCAGT GGGGGAGGGG ATATTTAGGG 
2 2 01 r 2 2CAT2AAAT TCATCAGCTC CGTTAAAGAG ACCCGAAAAC ACCAACAAT 2 

2. : : -l TAAGTGTTAA AATAGTGACT GCTATGCAAA TGGAGGTTTA AAACCTATG 2 
2-- '1 2 [TAG 2GGGA GTACCACCAG ATTACTAAC 2 GTAAACGCCA TCTGTAGGA 2 
2-: ; .l ATATTCT 3AA GCCACCACAG GGGAAGGGAT AAGGGCCTGA GAGACAAAGG 
2 4 '1 A-3A3AT3GGG TGTCCCCAAC AATTTAAGTT AAGTTCCACA AGGATA 2 A jT 
24 M ACTGG2AGAG ATTTGGAAGT AGGGGGAAGT ATTGTGAGAG AAGGGTGGTG 
2 2.il rXTTA 3GCAC CCTTCAATTA GGAGTAGCTA AAGG 2TGTGT GTGTGTGT 3T 
2 2 : 1 3T3TGTGTAT AAGAAAAGAA ATAGGAGGGT GTGTGTGTGG TAAGAAAGA 3 





: 1 


2 a r 


2TTGGCT 


GGGCGCGGTG 


GCT2ACCCCT 


ATAATCGCAG 


CACTTTGG 


2A 




: 1 


T 3 2 


2AAGGGT 


GGCGGATTGC 


CTGAGCTCAG 


GAGTTTGAGA 


CCATACGG 


jG 


-1 ; 


■'1 


CAA 


2ATGGTG 


AAACCCCATC 


TCTACTAAAA 


ATACAAAAAA 


TTAGCT3G 




■ 1 / 




at 


3TGGTGC 


GTGCCTATAG 


TTCCAGCTAC 


TCGGGAGGCT 


GAGGCAT 3 


\3 


2 3 




A AT 


3 3CTTGA 


GCCCTGGAGG 


CAGAGGTTGA 


AGTGAGCTGA 


GATCGCAC 


: a 


2 3 


M 


'i r 3 


2 ATT CCA 


GCTTGGGCTA 


CAGAGTGACA 


CTCCATCTCA 


AAAAAAAAAA 


2 ) 




AAA 


AAAAAAA 


AAAAACCAGC 


ATCTTTGCTG 


CCACTAGTCC 


ACTGTCTT 


L 3 


2 ) 


1 


CA 2 


rCACTCT 


CTGCCATGCC 


CATCCTTGTC 


CCCCTCCCCA 


CTCACAGA 


A 


3 0 


1 1 


T2,r 


2 3AACAA 


CAGCCCCGAG 


TATGCTTTGG 


TTTTCACCAT 


CTCGGGTG 


:t 


iO 


5 1 


ATG 


GCCACCA 


TGGTCTCCAG 


TGGCCTGGGT 


GCTGCCTGTG 


GCATGGCC 


AA 


31 


j 1 


OAA 


T3GCACC 


GGCATCATGG 


CCATGTCTGT 


CATGTGGCCA 


GAGCTGAT 
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3151 acatgaagtc catcatccca gtggtcatgg ctggtatcat caccatctat 

1201 ggcctagtgg cggctgtccc ccctggcaac tccctgaatg atgacaacag 

3 2 51 tctctatagc agtttcctcc agctgggcgc tggcctgagt gggctggcag 

3301 ccggctttgc catcgtcatc gtgggggaca ctggcaagtg tggcactgcc 

:% 5 1 cagcagcccc gactatttgt aggcatgata ctgatgctga tgtttggcaa 

uoi ggtgctcatt ctctccacaa agcagcccct ctcaaaaccg actagtcaca 

34 51 gaatacgatg taaagaccac ccctcctgat tccggaacaa acagcctgac 

3 5 01 acgcatgtgg tgggcagctg gccctcagta gttgatgttc taagtgtaca 

3 51 gtgtcctggt gttgatggtg tgttggcgag gccttgcccc ctcccgcccc 

3 601 atggtgtgga gatctgaagg tact cat tag cgatggaggt ccccgaccag 

3b51 tgaggactca ggcccctgga tgccccaccc atgtcggttg agtactctat 

3'' 01 GTATAAGGAT GAATTAGAGT TGTCATTTTC TGTTCATTAG ATATTTATAA 

3'/ 51 A3ATTTGGCC TGTGGATAGG GCTGTGGAGG AGCCCTGATC TCCCACCTAT 

3rt01 GTGTCACGTC ATGGAGGTTG GGATTGGGGA GGCTGGTTGG ATGGAACCAC 

3 351 C3TCTCCAGC CCGGGGTGGC AGGCCCTGTG GGGGAGCTGT GTCTGATAAA 

3'.-*01 GTTGTCAGAT GTCGGGGGAG GGAAAGAAAA AAAAAAGAGA GTGTGAGTAC 

3 051 GTAAGAGAGA GAAAAGGGGA GTGGGTGTGT GAGCTGGAGA CAGGGAAGT 3 
4 '101 3CAGGAAAAG TGTGATAAGA TCACCTCCTT GGTAGGGAAG GAG AG A TACT 

4 051 GGAGAGAGCC CGTCAAGGAC CCAGAGGGTA AGTA 3AGCGC GAGATGCTTG 
4 101 OCTTTCTCAA TGGGAGGTGG CCTCCCAGGC CTGAAGAAGT CTGCATTTAC 
4 151 OCCAGAGCCA ACTAGGAAGC AGGTAGACAG CATCATCCCG ACTTATACC 2 
43 01 CAAGGTGCTT GGGGTGAATG GCAGGCCCAA AGCGAAAGCA T GAGA GAG AT 

42 51 TAAATGTTCC TATGGCGAGA GAAGGAGAA3 GGGTCACCAG CATCTCTCCA 
4 301 CTGAGCAAAT GAAAGGAAGA GAGAAGGCAG GCTGATACCC TCATGAATTT 

43 51 COTACTGTCC ATGATATACC ACCATCAACT GGACTTTTTT TTTTTTTTT 3 

44 01 A 3ATAGAGTG TCGCTTTTGT CAGCCAGGCT GGAGTGCAGT GGCATGATCT 
44 51 CAGCTCACTG CAACTTCGAT CTCCCAGGTT CAAGTGATTC TCCCGCCT 2 A 
4 501 GG :tcctgag TAGCTGGGAT TAGAGGTGGG TGCTACCACA TCCAGGTGAT 
4 551 TTTTTTTGTA TTTTTAGTAG AGATGGGGTT TCTTTCTTTT TTTTTTTTTT 
4 -'01 TTTT3AGACG GAGTCTTGCT CTGTCGCCCA GGCTGGAGTG CAGTGGCGCG 
4o51 ATCTCGGCTC ACTGCAACGT CCGGGTCCGA GGTTCACGCC ATTGTCGTGC 
4/01 CTCAGCCTCC CGAGTAGCTG GGAGTACAGG CACCTGCCAG CAGACTGGGG 
4 7 5 1 TAATTTTTTG TATGTTTAGT AGATATGGGG TTTGACTGCT GTCTCAACCT 
4 301 TCT 3ACCTCA TGATCCGCCC GCGTCGGCCT GCGAAAGTGG TGGGATTAGA 
4351 G3CATGAGCC ACTGTGCCCG GGCTTTTTTT TTTTTTTTGA GATGGAGTGT 

4 001 CGCTCTGTCG CGCAGGCTGG AGTGCAATGC GAGAATCTGA GCTCACTGCA 
4^51 agct:cacgt CCGAGGTTCA CGCCATTCTC CTGCCTCAGC CTCGTGAGTA 
50 01 GCTGGGACTA CAGGCGCCCG CCACCACGCC CAGCTAATTT TTTGTATTTT 
5u51 TACT AG AG AG GGGGTTTCAC CTTGTTAGCG AGGATGGTGT TGATCTCCTG 
5101 ACCTGGTGAT CGACCTGCGT CAGCCTCCCA AAGTGCTGGG ATTACAGGTG 

5 151 TGAGCCAGCA TGCCTGGCGT TTTTTTTTTT TTTAAGACAG GAGTGTGGTG 
5 3 01 GGACAATGTC AGCTCACTGC AACCTCCCCT TCTAGGTTCA AGCAATTCTC 

53 51 CTGCCTCAGC TTCCTAAGTA TAGTAATAGC TGGGACTATA GGCGCCCAC 2 
5? 01 A 3CAC 3CCCG GCTAATCTTT TGTATTTTTA GTAGAGATGG GGTTTCACCA 
5:. 51 TGTTGGCGAG GCTGGTCTCG AATTGCTGAC CTCAAGTGAT CTGCCCACCT 

54 01 GG 3 :CTCCCA AAGTGCTGGG ACTATAGGCG GGAGCGACCG CGCCCAGCCT 
54 51 GGArrCTTTT TAATGAAGCC TTCAAAAAAA CTCCTTTTCT CAGCGCTTCT 
5 501 TACTC rCTGA AACAGACTCT CGACTCTGCT AACCCTGCCT CTCACACTGT 
5 l -i51 3GAACTCAA3 CGGATCTTTT TATTCTGAAT CCACAACGTG AAGTACTTGT 
50 01 C 3TC r jTCTA TCGATGGCTA CCTGTGTTTT GAAGTGTTTT TATGGGAATG 

5 0 5 1 aagcactgga ggggaggaaa tcagggcagt tctagaagta gaaggaaggc 
5 701 gaagaaagga ggaaaaatat ttatgtgatg ggaggaaagg cagtttataa 
5 751 at3actcatg gatctctatg ccagagggat gtgtgagaca cacgcatg 3a 
5 301 cacacacagt gacttgcagg tacatgcaga ggcagaaaca agtcaggaca 
5351 tgacacatac atgaatacac ataccattct catcagaaac cagtcagagc 
5 001 agaggggccc tgcctggagc aaggagactg gaatttattc ccctcctcct 

5 05 1 ct3aaagggt aattttgctg cctccatgtc taggttcccc acagatctgg 

6 001 ct3cctcaga caggggccct ggtctggtgg ctggactcag cctggaggtc 
5051 tt:a::a3atg gaggcgtata agaggtggca gctgacacct ggagggagct 
6 101 g3atgaaagc aggcagtgca gagtagagaa agccaggtgg tgggggaggg 
6 151 act 3 agog ag aagaggggag cagattcaag cagccttgcg ctggttctaa 
6 301 aat 3gccaca gcaaggcaac ggacagatgg tccctttctg atgctgagcc 
62 51 0 3 33aagt3g ggaaagggaa aaggaaaaaa taaacaccat cacagtcaga 



FIGURE 





6 3 01 AATTTAAAAA TAAACTGAAA AACCTAAAAA ATAAACCGT 

FEATURES : 

Start: 3000 
Exon : 3000-3614 
Stop: 3 6 1 C S 



CHROMOSOME MAP POSITION: 

Bac accession number: AC005973 

Chromosome 17 



ALLELIC VARIANTS (SNPs) : 

DNA 



Pos it ion 


Ma] or 


Minor 


Domain 




55 9 




A T 


3eyond 


OR? (5 1 ) 


3638 




C 


Beyond 


OR? (3 ' ) 


5446 




T A 


Beyond 


0RF(3 ' ) 


5808 


A 


G 


Beyond 


ORF ( 3 ' ) 


5892 


A 


c 


3eyond 


OR? (3 ' ) 


6071 


A 


G 


Beyond 


ORF (3 ' ) 



Context : 



DNA 

^os_it_ion 

5 5 9 CCCTTCCCTCTGTACCCCGCT 3CGCAT 3 03GCAAGTAGTTCCCGGACGO3ACCCTTC0CC 

rTGTACCCCGCTCCGCATGGGGCCAGTAGTTCCCGGACGCGCCCCTTCGCTCTGTACGCC 
■j rTGCGCATGCGGCAAGTAGTTCCCGGACGCGCCCCTTCCCTCTGTACCCGGCTCCGCAT 
GCGGAAAGTAGTTCCTACGGTGTTGGTTTTGCATGTAGATGAAACCCTTTGAGGGGTAAA 
GGTTTTTTTTTTAAGTACTTTAGCAAATGCAAACTGTTATTATCAATATTAGCCAGCATC 
[- ,A,T] 

TTTTTTTTTTTTTTTTTTTTTTTTTTGAGATGGAGTTTCGCTCTTGTCACCCAGGCTGGA 
GTGCAATGGCAAA-ATCTAGGCTGACTGGA.ACCTCCGCCTCCCAAGTTCA.AGCGATTGTGC 
TGCCTCAGCCTCCCAGGTAGCTGGGATTACAGGCGTGTGCAACCACACCCAGCTAATTTT 
TGTATTTTTAGTAGAijACAGGGTTTGA-::GATGTTGGCCAGGCTGGTCTCGAACTCGTGAC 
CTCATGTGATCCATCCGCCTCAGCCTCCCAAAGTGCTGGGATTACGTAGCCAGTGTCTTT 

3 63 8 GTGTGGCACTGCCCAGCAGCGCCGACTATTTGTAGGCATGATACTGATCCTCATCTTTGC 
CAAGGTGCTCATTCTCTCCACAAAGCAGCCCCTCTCAAAACCCACCAGTGACAGA.ATACG 
ATGTAAAGACCACCCCTCCTCATTCCGGAACAAACAGCCTGACACGCATGTGCTGGGCAG 
CTGGCC':TCAGTAGTTGATCTTGTAAGTGTACAGTGTCCTCGTGTTCATCGTCTGTTGGG 
CAGGCCTTGCCCCCTCCCGGC 3CATGCTGTGGACATCTGAACCTACTCATCACCCATCCA 

GTCCCCGACCAGTGAGGACTCAGGCCCCTGGATGCCCCACCCATCTCCCTTGAGTACTCT 
ATGTATAAGGATGAATTAGAGTTGTCATTTTCTCTTCATTAGATATTTATAAAGATTTGG 
CGTGTGCATACCCCTGTGGAGCAGCCC rCATCTCCCACCTATCTGTCACGTCAT 3GAGGT 
TCCCATTGCGGAGGCTCCTTGGATGGAAGCACCCTCTCCAGCCCGCGCTGCCAGGCCCTG 
TGCGGCAGCTGTGTCTGATAAAGTTCTCAGATGTCGGGGGAGGGAAAGAAAAAAAAAAGA 



54 4 6 AGGTGTGAGCCACCATGCCTGGGGTT rTTTTTTTTTTTAAGACAGGAGTGTGGTGGCACA 

ATCTCAGCTCACTGCAACCTCGGCTTCTAGGTTCAAGCAATTCTCCTGCCTCAGCTTCGT 
AAGTATAGTAATAGCTGGGACTATAGGCGCCCACCACCACGCCCGGCTAATCTTTT 3TAT 
TTTTAGTAGAGATGGGGTTTCAGCATGTTGGCCAGGCTGGTCTCGAATTGCTGA 3 CT 3AA 
GTGATCTGCCCACCTGGGCCTCCCAAAGTGCTGGGACTATAGGCGGGAGCCACCG 3GC zc 
[C,T,A] 

GCCTGGACTCTTTTTAATGAAGCCT rCAAAAAAACTCCTTTTCTCAGCGCTTCTTACT 3T 
CTGAAACAGACTCTCCACTCTG ITAAC 2C TGCCTCTCACACTGTGGAACTCAACCGGATC 
TTTTTATTCTGAATCCACAACGTG.AAG rACTTGTCCTCTGTCTATCGATGGCTACOT jTG 
TTTTGAAGTGTTTTTATGGGAATGAAGCACTGGAGGGGAGGAAATCAGGCCAGTT 3TA 3A 
AGTAGAAGGAAGGCGAAGAAACCA jGAAAAATATTTATGTGATGGGAGGAAAGGCAGTTT 
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5 8 0 8 TGAAACAGACTCTCCACTCTGCTAACCCTGCCTCTCACACTGTGGAACTCAACCGGATOT 
TTTTATTCTGAATCCACAACGTGAAGTACTTGTCCTCTGTCTATCGATGGGTAGGTGTGT 
TTTGAAGTGTTTTTATGGi^AATGAAGGACTGGAGG(3GAGGAAATCAGGC':AGTT':TAG,\A 
GTAGAAGGA.^GGCGAAGAAACCAGGAAAAATATTTATGTGATGGGAGGAAAjGCAGTTTA 

taaatca-:t-:atggatctctatgccagagggatgtgtgagacacacgcatggagagacac 
[a,g] 

ctgacttgcaggtacatggagaggcagaaacaagtcaggacatgacacatacatg.aatac 

ACATAi:CATTGTGATCAGAA\CCA5TCAGA(3CAGAGGGGCCCTGCCTGGAGCAA5GAi3A': 
TGGAATTTATTCCCCTCCTCCTCTCAAAGGGTAATTTTGCTGCCTCCATGTCTAGGTTCC 
CCACAGATCTGGCTGCCTCAGACAGGGGCGCTGGTCTGGTGGCTGGACTGAGCCTGGAGG 

tcttcacagatggaggcctataagaggtggcagctgacacctggagggag:tggatG/\aa 

5 8 9 2 agtacttgtcctctgtctatcgatggctacctgtgttttgaagtgtttttatggga^tga 
agcactggaggggagga.a^tcagG':cagttctagaagtagaaggaaggcga^gaa.^gcag 
gaaaaatatttatgtgatgggaggaa^gggagtttataaatcactcatggatctctatgc 
cagagggatgtgtgagacacacgcatgcagacacacactgacttgca^gtacatggagag 

GCAGAAACAAGTCAGGACATGACACATACATGAATACACATACCATTCTCATCA 3AAAC z 

gtcagagcagaggggccctgcctggagcaaggagactggaatttattccgctcctcctct 
caaagggtaattttgctgcctcgatgtctaggttccccacagatctggctgcctgagaca 
ggggccctggtctggtggctggactcagcctggaggtcttcacagatggagggctataag 
aggtggcagctgacacctggagggagctggatgaaagcaggcagtgcagagtagagaaag 
ccaggtggtgggggagggagtgagggagaagaggggaccagattcaagcaggcttgggct 

60 71 ccagagggatgtgtgagacacacggatgca :acacacactgacttgcaggtacatgcaga 

ggcagaaacaagtcaggacatgacacatacat3aatacacataccattgtcat:ag.aa.^c 
cagtcagagcagaggggcgctgcctggagcaa3gagactggaatttattcgcctcct:ct 

GTCAAAGGGTAATTTTGCTGCCTCCATGTCTAGGTTCCCCACAGATCT'SGCTGGCTCAGA 
CAGGGGCCCTGGTCTGGTGGCTGGACTCA2C rTGGAGGTGTTCACAGATGGAGGCCTATA 
[A, G] 

GAGGTGGCAGCTGACACCTGGAGGGAGrTGGATGAAAGCAGGCAGTGCAGAGTAGAGAAA 
GCCAGGTGGTGGGGGAGGGAGTGAGGGAGAAGAGGGGACCAGATTCAAGCAGCCTTGCGC 
TGGTTCTAAAATGGCCACAGCAAGGCAACGGACAGATGGTCCCTTTCTGATGCTGAGCCG 
GGGAAGTGGGGAAAGGGAAAAGGAAAAAATAAAGACCATCACAGTCAGAAATTTAAAAAT 
AAACTGAAAAACCTAAAAAATAAACCGT 
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